Linear Quadratic Mean Field Teams: Optimal and Approximately Optimal Decentralized Solutions
نویسندگان
چکیده
We consider team optimal control of decentralized systems with linear dynamics, quadratic costs, and arbitrary disturbance that consist of multiple sub-populations with exchangeable agents (i.e., exchanging two agents within the same sub-population does not affect the dynamics or the cost). Such a system is equivalent to one where the dynamics and costs are coupled across agents through the mean-field (or empirical mean) of the states and actions (even when the primitive random variables are non-exchangeable). Two information structures are investigated. In the first, all agents observe their local state and the mean-field of all sub-populations; in the second, all agents observe their local state but the mean-field of only a subset of the sub-populations. Both information structures are non-classical and not partially nested. Nonetheless, it is shown that linear control strategies are optimal for the first and approximately optimal for the second; the approximation error is inversely proportional to the size of the sub-populations whose mean-fields are not observed. The corresponding gains are determined by the solution of K+1 decoupled standard Riccati equations, where K is the number of sub-populations. The dimensions of the Riccati equations do not depend on the size of the sub-populations; thus the solution complexity is independent of the number of agents. Generalizations to major-minor agents, tracking cost, weighted mean-field, and infinite horizon are provided. The results are illustrated using an example of demand response in smart grids.
منابع مشابه
Team Optimal Decentralized Control of System with Partially Exchangeable Agents-Part 1: Linear Quadratic Mean-Field Teams
We consider team optimal control of decentralized systems with linear dynamics and quadratic costs that consist of multiple sub-populations with exchangeable agents (i.e., exchanging two agents within the same sub-population does not affect the dynamics or the cost). Such a system is equivalent to one where the dynamics and costs are coupled across agents through the mean-field (or empirical me...
متن کاملHaar Matrix Equations for Solving Time-Variant Linear-Quadratic Optimal Control Problems
In this paper, Haar wavelets are performed for solving continuous time-variant linear-quadratic optimal control problems. Firstly, using necessary conditions for optimality, the problem is changed into a two-boundary value problem (TBVP). Next, Haar wavelets are applied for converting the TBVP, as a system of differential equations, in to a system of matrix algebraic equations...
متن کاملRobust Mean Field Linear-Quadratic-Gaussian Games with Unknown L2-Disturbance
This paper considers a class of mean field linear-quadratic-Gaussian (LQG) games with model uncertainty. The drift term in the dynamics of the agents contains a common unknown function. We take a robust optimization approach where a representative agent in the limiting model views the drift uncertainty as an adversarial player. By including the mean field dynamics in an augmented state space, w...
متن کاملLqg Optimal Control Arising in Mean Field Decision Problems
Abstract. We consider a linear-quadratic-Gaussian (LQG) optimal control problem where the generalized state space is the product of an Euclidian space and an infinite dimensional function space. This model originates from a mean field LQG game with a major player and a large number of minor players, and has importance in designing decentralized strategies in the game. We show that the underlyin...
متن کاملLinear-Quadratic-Gaussian Mixed Games with Continuum-Parametrized Minor Players
Abstract. We consider a mean field linear-quadratic-Gaussian game with a major player and a large number of minor players parametrized by a continuum set. The mean field generated by the minor players is approximated by a random process depending only on the initial state and the Brownian motion of the major player, and this leads to two limiting optimal control problems with random coefficient...
متن کامل